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A trade-off relation on our knowledge about two noncommuting observables of a qubit system in 
simultaneous measurement is formulated. The obtained inequality offers a quantitative information- 
theoretic representation of Bohr's principle of complementarity, and can be interpreted as a trade- 
off relation on the asymptotic accuracy of the maximum-likelihood estimation of the probability 
distributions of observables. 
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Quantum mechanics features two distinct kinds of un- 
certainty. One is the quantum fluctuations inherent in 
a measured system, and the other is the noise caused 
by the process of measurement. Quantum fluctuations 
prevent us from knowing a quantum system beyond the 
probability distribution of the measured observable 
but the probability distribution itself can be accurately 
determined by means of an appropriate projection mea- 
surement. The uncertainty relation between two non- 
commuting observables such as the position and the mo- 
mentum originates from those fluctuations 
the other hand, the noise places a limit on the accuracy 
of simultaneous measurement. It is known, for example, 
that simultaneous measurements of two noncommuting 
observables implies that at least one of them cannot be 
measured without incurring a measurement error [§] . De- 
spite a long history of study [1, 0, H, , however, the fun- 
damental limit to simultaneous measurement of two non- 
commuting observables has yet to be fully understood. 

A classic analysis of the problem was given by 
Arthurs et al. [To . [TlT ] who, under a special condition 
called unbiasedness, have shown that the lower bound of 
the uncertainty product of canonically conjugate observ- 
ables is twice as large as the standard lower bound of fi/2, 
where H is the Planck constant divided by 2tt. The under- 
lying physics behind this doubling of the lower bound is 
that, under the condition of unbiasedness, fluctuations of 
a system's observable and the noise generated in the mea- 
surement process become uncorrelated and that they sim- 
ply add up. More recently, a number of studies on related 
problems are conducted without invoking the unbiased- 
ness condition and various uncertaint y r elations are de- 
rived iisiiilillSlillSliEliilil^l^liilSlS- 

In this paper, we derive a trade-off relation concern- 
ing the measurement accuracy of two noncommuting ob- 
servables for a qubit system by considering nonideal joint 
measurements. We also show that our characterization of 
the measurement accuracy is closely related to the Fisher 
information [24, 25] , which provides the asymptotic accu- 
racy of the maximum- likelihood estimation of the proba- 
bility distribution of an observable for a finite number of 
samples. In reality, only a finite number of samples are 
available [261 ]. which give us only an imperfect informa- 



tion about the probability distribution of an observable 
for an unknown state. The crucial observation made in 
this paper is that this imperfection is further deteriorated 
in the case of simultaneous measurement due to noncom- 
mutability of the observables. 

We first formulate a simultaneous measurement on a 
qubit system with respect to two observables A — tia ■ cr 
and B = ng where three-dimensional unit vectors 
tia and tib indicate the directions of the measurements 
with riA ■ n,B = cos# (0 < 6 < n), and <r = (a x , fr y , & z ) 
represents of the Pauli matrices. Since both A and 
B have eigenvalues of ±1, they can be represented as 
A = P A {+) - P A (-) and B = P B {+) - P B (-), where 
P A (B) (i) are projection operators corresponding to ob- 
servable A (B). We denote as p A {B){i) the probability 
distribution of A(B) given by 

Pa {i)=tr(pP a (i)) (a = A,B), (1) 

where p is the density operator of the measured qubit 
system. Each simultaneous measurement on the qubit 
system should yield a pair of outcomes (i, j) (i, j = ±) for 
observables A and B, because their eigenvalues are ±1. 
We characterize the probability distributions of obtaining 
with a positive operator-valued measure (POVM) 
{E(i,j)} EaUi, where E itj E(i,j) = l with I being 
the identity operator. The probability distribution q(i,j) 
of obtaining an outcome is given by 

q(i,j) = tx(pE(i,j)), (2) 
and the marginal POVMs are expressed by 

E A (i) = J2 E(i,3), E B (j)= J2 Mid)- (3) 
j=+- «=+,- 

The four positive operators E(i,j) are, in general, ex- 
pressed as 

E(i,j) = r ij I + x ij -&, (4) 

The requirements that the sum of the four operators 
equals the identity operator and that all of them be non- 
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negative are met if and only if 

i,j id 

The marginal POVMs are also expressed as 

E a (+) = rj + x a -&, E a (-) = 1- E a (+), (6) 

where r A = r ++ + r + _, r B = r ++ + r_+, x A = 

x ++ + , and xb = x ++ + x |_. The probability 

distributions of measurement outcomes are given by 

q a (i)=tT(pE a (i)). (7) 

In general, <m(*) and q B (i) do not respectively coincide 
withp^z) andps(z), because the simultaneous measure- 
ment entails a measurement error. 

In the following, we consider a class of simultaneous 
measurements called nonideal joint measurement [l5l.[l7T|. 
A simultaneous measurement A and B belonging to this 
class satisfies the condition that 

E a (i) = (FchPoti), (8) 
j=+- 

where Fa and Fb are two-dimensional square matrices: 

a ~ \ l-r aT \x a \ l-r a ±\x a \ )■ W 

It can be shown that xaZ/tia and xs/fris- Moreover, 
we can show that Fa and Fg are transition-probability 
matrices, or stochastic matrices, satisfying 

< |detF Q | < 1, |detF Q | = 2\x a \. (10) 

Note that the nonideality of measurement is character- 
ized only by the transition-probability matrix in the case 
of a nonideal measurement. A nonideal measurement 
expressed by Eq. ([5]) is formally interpreted as a mea- 
surement of A(B) to which classical noise, characterized 
by a transition-probability matrix or a noisy transmis- 
sion channel, is added. However, in general, the noise 
in Eq. ([5]) arises from interactions described by quantum 
mechanics. 

Let us now quantify the accuracy of measurement. Our 
purpose is to characterize the accuracy in such a manner 
that it depends only on the process of measurement and 
not on measured state p. To do this, we focus on the 
transition- probability matrix F a , which characterize the 
noise caused by the measurement process, and define 

X A = (detF A ) 2 , X B = (defies) 2 , (11) 

where < X A < 1 and < X B < 1. We shall refer to 
^A(B) as the accuracy of measurement for A (B). 

When Xa or Xg equals 1, the measurement of A or 
that of B can be shown to be a projection measurement. 
On the other hand, when Xa or X B equals 0, all of the 



operators in {Ea{i)} or {E B (i)} are proportional to the 
identity operator and therefore no information about A 
or B can be obtained from the measurement. Note that 
achieving both Xa = 1 and X B = 1 is impossible for 
simultaneous measurement, because a measurement error 
is unavoidable in at least one of the two noncommuting 
observables. In fact, we can derive a stronger trade-off 
relation between Xa and Xb ■ 

X A + Xb - X A X B cos 2 e < 1, (12) 

where 9 = cos~ 1 (tia ■ n B ). A similar inequality is ob- 
tained in Ref. for a special case of r a — tb = 1/2. 
The accessible regime for Xa and Xg is illustrated in 
FIG.l for the case of 9 = ir/6. For example, Xa — > 1 can 
be achieved only when Xg — * 0, indicating that when 
we measure one observable without any measurement er- 
ror, we cannot obtain any information about the other 
observable. Trade-off relation (fT2)) between Xa and Xg 
implies an upper bound on our knowledge about non- 
commuting observables for a qubit system. 




FIG. 1: (Color) Trade-off relation on the accuracy of noncom- 
muting observables. P and Q indicate the regimes satisfying 
inequality (|12[) for the case of 6 = tt/6. The regime Q is 
accessible only through simultaneous measurement. 

The derivation of inequality (fT2")l goes as follows. From 
Eqs. (O, we have 

\x A + x B -y\< 2r ++ , \x A ~x B + y\< 2r + _, 
\x A - x B - y\ < 2r_ + , \x A + x B +y\< 2r — , 

where y = x^ + x |_. By using (fl3|) and the triangle 

inequality, we obtain \xa + xb\ + \xa ~ x B \ < 1. Noting 
that jdetF^I = 2 1 | , |detFe| = 2\x B \ and xa ■ xb = 
\xa\\x b \ cos 9, we obtain (fT2")l . 

We give an alternative expression of inequal- 
ity (TJJD [12|. Let us define £ A = {l/X A ) - 1 and 
£b = (l/X B ) — 1, where < £a < oo and < £b < oo. 
Here, £a — implies Xa = 1, that is, the measurement 
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of observable A involves no measurement error. Hence, 
we can interpret £a and £b as parameters characteriz- 
ing measurement errors. In terms of £a and 8b, we can 
express (fT2l as 



£a£b > sin 9. 



(14) 



This inequality manifestly shows that a trade-off rela- 
tion exists between measurement errors £a and £b for 
simultaneous measurement if the corresponding observ- 
ables are noncommuting (i.e. 8^0). 

An optimal simultaneous measurement that satisfies 
the equality in (fl2|) with Xa = Xb is achieved when 
x ++ + x — = 0, a; + _ + x- + = 0, \x ++ \ = r ++ = r — , 

|sg_I | = r_| = r |-, and r ++ + = 1/2. The 

corresponding POVM {E(i,j)} is given by E(i,j) = 



\x.ij \I + Xij • a with 



x ++ = r(n A + n B ), X— 
x + - = r(n A - n B ), x +H 



'+-■> 



(15) 



where r = ± (\tia + «b| + \tia — n s|)~ /2. These pos- 
itive operators are proportional to projection operators, 
and the directions of the projection are mutually orthog- 
onal [H]. 

Inequalities (fT2"|) and (|14[) offer a rigorous representa- 
tion of Bohr's principle of complementarity. According to 
his classic paper 8] , the essence of the principle of com- 
plementarity is "the mutual exclusion of any two experi- 
mental procedures" when measuring two noncommuting 
observables simultaneously. Inequalities (fT2|) and (jT4j) 
quantitatively represent this incompatibility. 

We next point out a close connection between the accu- 
racy of measurement X a and the asymptotic accuracy of 
the maximum-likelihood estimation of p a (i)- Let us con- 
sider simultaneous measurements for each of N (< oo) 
samples prepared in the same unknown state p. The 
issue here is how accurately we can estimate the true 
probability distributions of A and B from a finite num- 
ber of samples. The condition |[8j of the nonideal joint 
measurement is equivalent to the condition that 



(16) 



j=+-- 



holds for any p. In this case, probability distributions 
q a (i) is parameterized by true probability distributions 
p a (i), so that we can estimate p a (i) by using the method 
of the classical maximum-likelihood estimation except for 
the case of xa — or xb = 0. We assume that xa ^ 
and xb ^ in the following analysis. 

Suppose that we obtain outcome "+" N a (+) times and 
outcome "-" N a (-) times, where N a (+) + N a (-) = N. 
The likelihood function then becomes 



L a (p a (+))= N a (i)\nq a (i). 



(17) 



*=+, 



We denote as p* the maximum-likelihood estimator of 
p a {+) inferred from N measurement outcomes such that 



L a (Pa{+)) takes the maximum value with p a (+) = p* a 
under condition < p a {+) < 1. The accuracy of the es- 
timate can then be asymptotically characterized by the 
following theorem [24j, |25( : the distribution of p* a ap- 
proaches the normal distribution with average p a {+) and 
variance (iV/ Q ) _1 as the number of samples N increases 
with I a being the Fisher information. Note that this the- 
orem does not hold if p Q (+) = or 1. In other words, 
the maximum-likelihood estimator p* a asymptotically ap- 
proaches true value p a (+), the asymptotic behavior being 
characterized by Fisher information I a . The larger the 
Fisher information, the more information we can extract 
from the measurement outcome. In the special case of 
I a = 0, no information about the measured system can 
be obtained from measurement. Once we have found 
Pa{+), Pa(-) is obtained from p a (— ) = 1 —#«(+)• In 
the present situation, the Fisher information is given by 



E 

»=+,- 



d 2 mg a (z) 



X,, 



(18) 



q a (+)q a {- 



Note that in simultaneous measurement the Fisher infor- 
mation on observable A (B) is deteriorated by a factor 
of Xa(b) hi comparison with projection measurement. 

We thus conclude that the trade-off relation (TT2|) or 
(fT4"]) indicates that knowing about one of the probabil- 
ity distributions of the two noncommuting observables 
prevents us from knowing about the other because the 
number of available samples is finite; the upper bound 
on the estimation accuracy of the probability distribu- 
tions is severely restricted, unless the number of avail- 
able samples is infinite. Note that if N is infinite, we can 
accurately reconstruct both pA{i) and ps(i) from mea- 
sured probability distributions qA(i) and (?b(?); with in- 
finite samples, the noncommutability does not affect the 
accuracy of the estimates of probability distributions of 
noncommuting observables. 

We have considered the estimates of probability distri- 
butions Pa{}) and pb(J-) for simultaneous measurement 
by using the POVM which is made up by four positive op- 
erators. We now discuss a simple strategy for estimating 
the probability distributions of noncommuting observ- 
ables. We divide N identically prepared samples into two 
groups according to the ratio £ : 1 — £ (0 < £ < 1), and 
estimate pa{i) by measuring A for the first group using 
POVM {i?A(±)}, and similarly we estimate for the 

second group using POVM {_Eb(±)}. For a large num- 
ber of samples, this measurement is asymptotically de- 
scribed by POVM {£E A {±), (1-£)E B (±)}. In this case, 
the accuracy of the measurement is limited by Xa < £ 
and Xb < 1 — £, because the accuracy per sample is de- 
teriorated by a factor of £ and 1 — £, respectively. We 
thus obtain Xa + Xb < 1 (domain P in FIG.l), and 
therefore conclude that simultaneous measurement has 
an advantage over this simple method in that the do- 
main Xa + Xb > 1 for 9 ^ tt/2, i.e. domain Q in FIG.l 
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is accessible. 

It is worth pointing out that we can interpret the trade- 
off relations (fl~2"f and (fl4|) as uncertainty relations be- 
tween the measurement error and the back-action of the 
measurement @, [H, HH H3, HH [H, HI] . Let us suppose 
that p' is a state immediately after measurement of A for 
premeasurement state p. To identify the disturbance of 
B caused by A, we consider how much information about 
B for premeasurement state p is left in postmeasurement 
state p' . We characterize this by considering how much 
information on p can be obtained by measuring B for p' . 
The joint operation of measurement A being followed by 
measurement B can be described by a POVM {E(i j J]} 
corresponding to the measurement outcomes [281 ] - 

If the POVM fulfills the requirement that its marginal 
POVMs describe nonideal measurements of A and B, we 
again obtain inequality (fl~2|) . It is possible to interpret 
1 — Xb as a measure of the back-action of B caused by 
measurement of A. Defining the measurement error of A 
as Za = (1/Xa) — 1 and the back-action of the measure- 
ment on B as T>b = (1/Xb) — 1, we obtain 

£aDb > sin 2 9. (19) 

We should note that a non-selective measurement pro- 
cess for A can simulate the decoherence caused by the 
environment. In this case, trade-off relations (fT2"]) and 
(fl~9)) give a lower bound to the back-action of B in the 
presence of decoherence characterized by Xa ■ 

Note that if it were possible to perform error-free mea- 
surement of A without disturbing B, then we could pre- 
cisely measure both A and B by performing measurement 
A followed by measurement B. We can say that the for- 
mal similarity of two kinds of trade-off relations, (fL2"|) 
(or equivalently (fT4"]) ) and (|19[) . quantitatively represents 
the above-mentioned connection between the two kinds 
of uncertainty relations [l|| H(| : the uncertainty relation 
between the error of A and that of B, and the uncertainty 
relation between the error of A and the back-action of 
B. In the context of the maximum- likelihood estima- 
tion with finite samples, the number of samples required 
to achieve a given accuracy in estimating the probability 
distributions of both A and B is greater than the number 
for the case of estimating either A or B, because samples 
on which we perform precise measurements of A lose the 



information about the probability distribution of B, due 
to the back-action of measurement A. 

Finally, we mention the relevance of our work to pre- 
vious work in the case of a qubit system. Arthurs and 
Goodman [Til ] have discussed simultaneous measurement 
by assuming the unbiasedness for observables A and B. 
This condition implies that the arithmetic average of the 
measurement outcomes approaches to the true average 
in the limit of N — > oo. In the case of a qubit sys- 
tem, the unbiasedness condition implies Xa = Xb = 1 
because the probability distribution has a one-to-one cor- 
respondence with the average value. As discussed earlier, 
however, no simultaneous measurement satisfies this con- 
dition. We therefore cannot address simultaneous mea- 
surement by the method of Arthurs and Goodman for a 
qubit system. On the other hand, Andersson et al. [22| 
have relaxed the unbiasedness condition and adopted the 
condition that the arithmetic average of the measure- 
ment outcomes is proportional to the true average in 
the limit of N — + oo. This condition is satisfied if and 
only if r a = tb = 1/2. In this paper, we have consid- 
ered the most general class of measurements to which 
we can use the maximum-likelihood estimation to the 
marginal probability distributions, without assuming the 
unbiasedness condition. This class of measurements has 
been studied by de Muynck and Martens who have de- 
rived a trade-off relation in terms of the Shannon channel 
capacity (lol \v\. 

In conclusion, we have derived a trade-off relation con- 
cerning the accuracy of simultaneous measurement of two 
noncommuting observables for a qubit system. The re- 
lation gives an upper bound on our knowledge about 
noncommuting observables. Moreover, we have pointed 
out that the accuracy parameter Xa(b) is quantitatively 
related to the asymptotic accuracy of the maximum- 
likelihood estimation of the probability distribution of 
observable A(B). The generalization of our results to 
high-spin systems and continuous-variable systems mer- 
its further study. 
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